Pattern Discovery for Multiple Data Sources Based on Item Rank
نویسندگان
چکیده
Retail company’s data may be geographically spread in different locations due to huge amount of data and rapid growth in transactions. But for decision making, knowledge workers need integrated data of all sites. Therefore the main challenge is to get generalized patterns or knowledge from the transactional data which is spread at various locations. Transporting data from those locations to server site increases the cost of transportation of data and at the same time finding patterns from huge data on the server increases the time and space complexity. Thus multi-database mining plays a vital role to extract knowledge from different data sources. Thus the technique proposed finds the patterns on various sites and instead of transporting the data, only the patterns from various locations get transported to the server to find final deliverable pattern. The technique uses the ranking algorithm to rank the items based on their profit, date of expiry and stock available at each location. Then association rule mining (ARM) is used to extract patterns based on ranking of items. Finally all the patterns discovered from various locations are merged using pattern merger algorithm. Proposed algorithm is implemented and experimental results are taken for both classical association rule mining on integrated data and for datasets at various sources. Finally all patterns are combined to discover actionable patterns using pattern merger algorithm given in section V.
منابع مشابه
Group Pattern Discovery Systems for Multiple Data Sources
INTRODUCTION Multiple data source mining is the process of identifying potentially useful patterns from different data sources, or datasets (Zhang et al., 2003). Group pattern discovery systems for mining different data sources are based on local pattern-analysis strategy, mainly including logical systems for information enhancing, a pattern discovery system, and a post-pattern-analysis system.
متن کاملUsing Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning
Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...
متن کاملMethods for the Efficient Discovery of Large Item-Indexable Sequential Patterns
An increasingly relevant set of tasks, such as the discovery of biclusters with order-preserving properties, can be mapped as a sequential pattern mining problem on data with item-indexable properties. An item-indexable database, typically observed in biomedical domains, does not allow item repetitions per sequence and is commonly dense. Although multiple methods have been proposed for the effi...
متن کاملDomain-Aware Multi-Truth Discovery from Conflicting Sources
In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possib...
متن کاملDetermination of the Parameters of Six Multiple Choice Tests of Mashhad University of Medical Sciences (1389-90) based on Item-Response Theory (IRT)
Background: According to the industrialization of countries and development of societies, tests and methods are required to employ people in industries and organizations and make the best selection in getting workforce. Interviews, Written tests , and multiple choice tests are common methods used in employing people. Among these methods , multiple choice tests is the easiest one because of th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017